GeneComber: Combining Outputs of Gene Prediction Programs for Improved Results

نویسندگان

  • Sohrab P. Shah
  • Graham P. McVicker
  • Alan K. Mackworth
  • Sanja Rogic
  • B. F. Francis Ouellette
چکیده

UNLABELLED We recently demonstrated that combining the output from Genscan and HMMgene can provide increased accuracy of gene predictions. We have created a robust software system that runs algorithms previously described on DNA sequences and provides a public web interface to the system for use by the biological community worldwide. The GeneComber system performs ab initio gene prediction by first taking a user inputted DNA sequence and running Genscan and HMMgene. The outputs of Genscan and HMMgene are then integrated using the EUI, GI and EUI_frame algorithms. All results are then stored into a relational database management system (RDBMS) and can then be retrieved through a web interface. The web interface provides a unified view of the GeneComber predictions by graphically overlaying outputs from Genscan, HMMgene, EUI, GI and EUI_frame. Outputs can also be retrieved in general feature format (GFF) or FASTA format. The software is written in the Perl programming language and is both dependent on and interoperable with the Bioperl toolkit. It includes high-level application programming interfaces (APIs) to run Genscan, HMMgene and a database API to insert prediction results into an RDBMS. The APIs are assembled into the genecomber script which is executed by the web interface or can be run directly from the Unix command line. The web interface is written in PHP and is structured so as to be easily modified for viewing data from any database that stores gene structures. AVAILABILITY The GeneComber public web interface and supplementary information is located at http://bioinformatics.ubc.ca/genecomber The source code is released under the GNU General Public License and is available at ftp://ftp.bioinformatics.ubc.ca/pub/genecomber/software.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Alpha-cut Fuzzy C-means, Fuzzy ARTMAP and Cox Regression Model for Customer Churn Prediction

As customers are the main asset of any organization, customer churn management is becoming a major task for organizations to retain their valuable customers. In the previous studies, the applicability and efficiency of hierarchical data mining techniques for churn prediction by combining two or more techniques have been proved to provide better performances than many single techniques over a nu...

متن کامل

Study on Combining Ability and Gene Effects Estimation in Some Sweet Corn Inbred Lines (Zea mays L. var saccarata) by Line × Tester Method

In breeding programs determination of gene effects and general and specific combining ability for screening of test crosses is necessary. In order to estimate the genetic variance components and the general and specific combining ability of sweet corn lines, an experiment was conducted using 8 sweet corn S6 inbred lines (including 4 maternal and 4 paternal lines) by line × tester mating design ...

متن کامل

Improving gene recognition accuracy by combining predictions from two gene-finding programs

MOTIVATION Despite constant improvements in prediction accuracy, gene-finding programs are still unable to provide automatic gene discovery with desired correctness. The current programs can identify up to 75% of exons correctly and less than 50% of predicted gene structures correspond to actual genes. New approaches to computational gene-finding are clearly needed. RESULTS In this paper we h...

متن کامل

Operon prediction in Pyrococcus furiosus

Identification of operons in the hyperthermophilic archaeon Pyrococcus furiosus represents an important step to understanding the regulatory mechanisms that enable the organism to adapt and thrive in extreme environments. We have predicted operons in P.furiosus by combining the results from three existing algorithms using a neural network (NN). These algorithms use intergenic distances, phyloge...

متن کامل

Estimation of Combining Ability and Gene Action for Agro-Morphological Characters of Rapeseed (Brassica Napus L.) Using Line×Tester Mating Design

Combining ability effects were estimated for different agronomic characters in line × tester crossing program comprising 21 hybrids produced by crossing 7 lines and 3 testers. Parents and hybrids differed significantly for general combining ability (GCA) and specific combining ability (SCA) effects, respectively. The variance due to GCA and SCA showed that gene action was predominantly additive...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 19 10  شماره 

صفحات  -

تاریخ انتشار 2003